The Design and Construction of A Chinese Collocation Bank
نویسندگان
چکیده
This paper presents an annotated Chinese collocation bank developed at the Hong Kong Polytechnic University. The definition of collocation with good linguistic consistency and good computational operability is first discussed and the properties of collocations are then presented. Secondly, based on the combination of different properties, collocations are classified into four types. Thirdly, the annotation guideline is presented. Fourthly, the implementation issues for collocation bank construction are addressed including the annotation with categorization, dependency and contextual information. Currently, the collocation bank is completed for 3,643 headwords in a 5-million-word corpus
منابع مشابه
Annotating Chinese Collocations with Multi Information
This paper presents the design and construction of an annotated Chinese collocation bank as the resource to support systematic research on Chinese collocations. With the help of computational tools, the bi-gram and n-gram collocations corresponding to 3,643 headwords are manually identified. Furthermore, annotations for bi-gram collocations include dependency relation, chunking relation and cla...
متن کاملThe Construction of a Chinese Collocational Knowledge Resource and Its Application for Second Language Acquisition
The appropriate use of collocations is a challenge for second language acquisition. However, high quality and easily accessible Chinese collocation resources are not available for both teachers and students. This paper presents the design and construction of a large scale resource of Chinese collocational knowledge, and a web-based application (OCCA, Online Chinese Collocation Assistant) which ...
متن کاملConstruction of Semantic Collocation Bank Based on Semantic Dependency Parsing
Collocation has always been an important issue in language research, especially in Chinese language researches. Chinese is an isolated language, which lacks morphological changes.Establishing a relatively complete dictionary of Chinese collocation will be a great contribution to Chinese study and research. Collocation plays a significant supporting role in many fields of NLP, such as informatio...
متن کاملBuilding a Chinese Shallow Parsed TreeBank for Collocation Extraction
To automatically extract Chinese collocations and build a large-scale collocation bank, we are developing a one-million-word Chinese shallow parsed treebank. The treebank can be used not only as a training set for our shallow parser, but also as processed data from which collocations are extracted. This paper presents several issues related to this on-going project, such as our definition of sh...
متن کاملA spline collocation method for integrating a class of chemical reactor equations
. In this paper, we develop a quadratic spline collocation method for integrating the nonlinear partial differential equations (PDEs) of a plug flow reactor model. The method is proposed in order to be used for the operation of control design and/or numerical simulations. We first present the Crank-Nicolson method to temporally discretize the state variable. Then, we develop and analyze the pro...
متن کامل